Biped Balance Control by Reinforcement Learning
نویسندگان
چکیده
This work studied biped walking with single (one-leg) support and balance control using reinforcement learning. The proposed Q-learning algorithm makes a robot learn to walk without any previous knowledge of dynamics model. This balance control with single support shifts the Zero Moment Point (ZMP) of the robot to a stable region over walking sequences by means of learned gestures. Hence, the proposed method could be applied to biped walking on either plain or sloping surfaces with the help of sensory inputs. The reinforcement learning mechanism was used as the position control of the robot joints. While the robot was walking, it continuously adjusted its gaits via learning and finally formed gaits that were as stable as when it walked on the level surface. After the robot had learned to walk on an even terrain, it could learn to climb an inclined surface faster using its newly acquired knowledge. Experiments of biped walking on an even terrain and a seesaw were performed to show the validity of the proposed reinforcement learning mechanism.
منابع مشابه
Dynamic Control Algorithm for Biped Walking Based on Policy Gradient Fuzzy Reinforcement Learning
This paper presents a novel dynamic control approach to acquire biped walking of humanoid robots focussed on policy gradient reinforcement learning with fuzzy evaluative feedback . The proposed structure of controller involves two feedback loops: conventional computed torque controller including impact-force controller and reinforcement learning computed torque controller. Reinforcement learnin...
متن کاملEpisodic Reinforcement Learning Control Approach for Biped Walking
This paper presents a hybrid dynamic control approach to the realisation of humanoid biped robotic walk, focusing on the policy gradient episodic reinforcement learning with fuzzy evaluative feedback. The proposed structure of controller involves two feedback loops: a conventional computed torque controller and an episodic reinforcement learning controller. The reinforcement learning part inclu...
متن کاملRobust Movement Policies for Physics-based Characters in 2-D
This course project outlines an approach for constructing simple, relatively robust control strategies for a simple, physics-based biped character. A simple, articulated character was first built from links in the BOX2D simulator, and tested during each phase of creation. These phases consisted of specifying torques for each joint of the character, creating Proportional Derivative (PD) controll...
متن کاملReinforcement Learning for a CPG-driven Biped Robot
Animal’s rhythmic movements such as locomotion are considered to be controlled by neural circuits called central pattern generators (CPGs). This article presents a reinforcement learning (RL) method for a CPG controller, which is inspired by the control mechanism of animals. Because the CPG controller is an instance of recurrent neural networks, a naive application of RL involves difficulties. ...
متن کاملPoincaré-Map-Based Reinforcement Learning For Biped Walking
We propose a model-based reinforcement learning algorithm for biped walking in which the robot learns to appropriately modulate an observed walking pattern. Viapoints are detected from the observed walking trajectories using the minimum jerk criterion. The learning algorithm modulates the via-points as control actions to improve walking trajectories. This decision is based on a learned model of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Inf. Sci. Eng.
دوره 32 شماره
صفحات -
تاریخ انتشار 2016